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Abstract 

In this article we investigate on the convergence of the natural iteration method, a numerical 
procedure widely employed in the statistical mechanics of lattice systems to minimize Kikuchi's 
cluster variational free energies. We discuss a sufficient condition for the convergence, based on the 
coefficients of the cluster entropy expansion, depending on the lattice geometry. We also show that 
such a condition is satisfied for many lattices usually studied in applications. Finally, we consider 
a recently proposed general method for the minimization of non convex functionals, showing that 
the natural iteration method turns out as a particular case of that method. 
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I. INTRODUCTION 



The cluster variation method (CVM) is a powerful approximate technique for the sta- 
tistical mechanics of lattice systems, which can improve the simple mean field and Bethe 
theories, by taking into account correlations on larger and larger distances. It was first 
proposed by Kikuchi in 1951 |l| as an approximate evaluation of the thermodynamic weight 
of the system, and since then it has been reformulated several times 0, 0, 0], mainly to 
clarify the nature of the approximation and to simplify the way to work it out. Quite a 
recent formulation j^J shows that the CVM consists in a truncation of the cumulant entropy 
expansion. Each cumulant is associated to a cluster of sites and the truncation is justified 
by the expected rapid vanishing of the cumulants upon increasing the cluster size. In this 
way the CVM can be viewed as a hierarchy of approximations, each one defined by the set 
of maximal clusters retained in the cumulant expansion, usually denoted as basic clusters. 
If pairs of nearest neighbor sites are chosen as basic clusters, the CVM coincides with the 
Bethe approximation. Generally, using larger basic clusters improves the approximation, 
even if the convergence of the cumulant expansion to the exact entropy has been rigorously 
proved just in a few cases 

Due to its relative simplicity and accuracy, the CVM is widely used in every kind of 



statistical mechanical applications, to determine both thermodynamic properties 
and phase diagrams l| [ 
Monte Carlo simulations 



. The CVM results generally compare well wit 




Baa 



i those of 



14 



15| . Making 



as well as experimental ones 
use of suitable series of CVM approximations, it is also possible to extrapolate quite accurate 
estimates of critical exponents Q^BQ. Recently, it has been shown that the belief 
propagation algorithm, an approximate method for statistical inference, employed for a lot 



of technologically relevant problems (image 
correcting codes j^,0|; machine learning 



20 



and signal processing 2l|, decoding of error- 



22|), is actually equivalent to the minimization 
of a Bethe free energy for statistical mechanical models defined on graphs j^. This fact 
has opened new research areas both to the application of the CVM as an improvement of 
the approximation j^J , and to the analysis of efficient minimization algorithms 24 , 2^|, 0] , 



mainly due to the fact that belief propagation sometimes fails to converge. 

Let us introduce the problem from the CVM point of view. Once the approximate 
entropy (and hence free energy) for the chosen set of basic clusters has been obtained, one 
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has to face the problem of minimizing a complicated non-convex functional in the basic 
cluster probability distributions. An algorithm for minimizing such a functional has been 
proposed by Kikuchi himself j^, an d is known as natural iteration method (NIM). A proof 
of convergence of this algorithm has been given in the original paper, essentially for the 
Bethe approximation, which can be easily extended to the Husimi tree [2^. Nevertheless, 
the range of convergent cases seems to be much wider, so that the natural iteration method 
might be interesting also for the non conventional applications mentioned above. 

In this article we analyze a sufficient condition for the convergence of the NIM. Such a 
condition is a requirement on the coefficients of the cluster entropy expansion (obtained from 
the cumulant expansion through a Mobius inversion 4)]) and is shown to hold for quite a large 
variety of approximations that are generally used to treat thermodynamic systems. Namely, 
we consider: a set of "plaquette" approximations on different lattices jsl Q, 13 > Kikuchi's 



B and C hierarchies for the square [30| and triangular |3l( lattices, the cube approximation 
for the simple cubic lattice. As far as the latter case is concerned, we actually analyze a 
generic hypercube approximation on the hypercubic lattice in d dimensions, showing that 
the sufficient condition holds for d < 3. Finally we take into account a recently proposed 
algorithm for the minimization of the CVM free energy |25| , which allows several alternatives, 
depending on the possibility of upperbounding the free energy with convex (easy to be 
minimized) functions. We show that one of the best choices is actually equivalent to the 
natural iteration method. 



II. THE CVM FREE ENERGY 

As mentioned in the Introduction, the approximate CVM entropy can be written as a 
linear combination of cluster entropies |4| 

S = 1 ^2a a S a , (1) 

a 

where the sum index a runs over all basic clusters and their subclusters. We shall always 
consider clusters in this set only. The cluster entropies are defined as usual 

S a = - ^2Pa(x a ) hgp a (x a ), (2) 

X(x 

where p a (x a ) denotes the probability of the configuration x a for the cluster a, the sum 
runs over all possible configurations, and the Boltzmann constant k is set to 1 (entropy is 
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measured in natural units). The coefficients can be determined recursive 
basic clusters down to subclusters, making use of the following property |4 ] 



y, starting from 



a'Da 



1 Va. 



(3) 



Due to the fact that a basic cluster 7 never contains (by definition) another basic cluster, 
from the above formula we immediately get a 7 = 1 V7. Here and in the following, 7 denotes 
basic clusters. As far as the hamiltonian is concerned, we assume that it can be written as 



a sum of contributions h~ from all basic clusters as 



n 



E 

7 



(4) 



where of course x 1 denote basic cluster configurations. Let us decide to write the whole CVM 
free energy as a sum over basic clusters, splitting entropy contributions from each subcluster 
among all basic clusters that contain it (in equal parts). Assuming energies normalized to 
kT, we obtain 



7 ^7 



hjiXj) + logp 7 (2 7 ) + ^ b a logP 7 (^a) 



0C7 



where 



P 7 ( 



X n 



(5) 



(6) 



C 7 \c 



Let us notice that we have defined new coefficients b a = a a /c a , where c a denotes the number 
of basic clusters that contain a, and we have expressed subcluster probability distributions 
as marginals of basic cluster distributions, according to Eq. © (the sum runs over configu- 
rations x 7 \ a of the basic cluster 7 minus the subcluster a). 



III. THE NATURAL ITERATION METHOD 



In the above formulation, basic cluster distributions {p 7 (x 7 )} are the variational pa- 
rameters of the free energy (which is denoted in short by i^fp]), and the thermodynamic 
equilibrium state can be determined by minimization with respect to these parameters with 
suitable normalization and compatibility constraints. By compatibility we mean of course 
that marginal distributions p 1 {x a ) must be the same for all basic clusters 7 D a. Let us 
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notice that, for most thermodynamic applications, one usually makes some homogeneity as- 
sumption on the system, and this generally reduces the problem to only one or few different 
basic cluster distributions. Compatibility constraints may be still necessary to impose the 
required symmetry. We go on with the complete formulation, without loss of generality. The 
important thing is that in any case we deal with constraints that are linear in the probability 
distributions (compatibility), possibly with an additive constant (unit) term (normalization). 
According to the Lagrange method, we transform the constrained minimum problem with 
respect to {p 1 {x 1 )} to a free minimum problem for an extended functional which depends 
on additional parameters (Lagrange multipliers). Due to linearity, the extended functional 
can be written in the form 

F[p, A] = F[p] - $> 7 (x 7 )A 7 (a; 7 ), (7) 

7 

where {A 7 (a; 7 )} are the Lagrange multipliers. Of course, {A 7 (a; 7 )} are not all independent 
variables, but internal relationships are system dependent, and we do not analyze them. 
Let us only notice, for future use, that the difference between the new functional and the 
original one (the last term in Eq. (J7J)) is actually independent of the {p 7 (x 7 )} distributions, 
provided they satisfy the required constraints. 

The derivatives of F with respect to p 7 (x 7 ) turn out to be 

jr- -r 4 = /i 7 (x 7 ) + logp 7 (x 7 ) + ^2 ba logp 7 (x a ) - A 7 (x 7 ) + const., (8) 

where the additive constant is irrelevant and we can absorb it into the Lagrange multipliers. 
Setting the above derivatives to zero resolves stationarization with respect to probability 
distributions. The natural iteration method consists in rewriting such equations in a fixed 
point form, that is 

p 7 (x 7 ) = e *i(*7)-M*7) J] [ Py (x a )}- K , (9) 

and then solving them by simple iteration. A new estimate of the basic cluster probability 
distribution p 7 (a; 7 ) is obtained from the previous one p 7 (x 7 ) trough its marginals p 7 (x a ). 
The Lagrange multipliers must be determined at each iteration, so that also j3 7 (x 7 ) satisfies 
the required constraints. This job can be done in different ways by a nested procedure (inner 
loop), for instance a Newton- Raphson method or a suitable fixed point method 0,1^4. In 
this paper we do not deal with the determination of Lagrange multipliers, but we only focus 
on the convergence of the main loop. 



IV. SUFFICIENT CONDITION FOR THE CONVERGENCE 



As usual for iterative algorithms designed to minimize functionals that are bounded from 
below, a proof of convergence can be given by the decreasing of the functional value at each 
iteration. This is actually the case for the natural iteration method. Let us consider the 
free energy difference F[p] — F\p\ for two subsequent iterations p,p, where F[p] is defined 
by Eqs. (0) and ©• Taking the logarithm of both sides of Eq. ©, we can rewrite the NIM 
equations in two different ways, that are 

logp 7 (x 7 ) = A 7 (s 7 ) - /i 7 (a; 7 ) - E &a logp 7 (x a ) (10) 

^2 b » ^gp 7 (x a ) = A 7 (s 7 ) - h^Xj) - logp 7 (x 7 ). (11) 

Let us replace the former into F[p] and the latter into F\p\. Remembering that probability 
distributions satisfy the constraints, whence latter term on the right hand side of Eq. (JZJ) 
depends on Lagrange multipliers only, we obtain 

m - m = E E ^ £#r - ^ w E^ lo sf#yV ( 12 ) 

Let us consider the inequality log £ < £ — 1 , observing that equality holds if and only if £ = 1 . 
By applying this inequality to the first logarithm (the one involving basic cluster probability 
distributions) in Eq. (|T2j). and taking into account that distributions are normalized, we 
obtain 

m - F[ P ] < - e E w E h « lo s ItH' (13) 

where equality holds if and only if p 7 (x 7 ) = p 7 (x 7 ) V7,x 7 . The same result could be 
obtained by observing that actually the upperbounded terms coincide with (minus) the 
Kullbach-Liebler distances between the probability distributions p 1 (x 1 ) and p 7 (x 7 ). If all 
subcluster coefficients b a were negative, we could apply the same argument to all terms, 
and the upperbound would be zero. Such a situation occurs for instance in the Bethe |23l 
and Husimi tree [2£( approximations, and the proof of convergence would be complete. In 
a general case we have to require a condition on the b a coefficients. The basic idea is 
to "couple" smaller cluster terms with a positive coefficient to larger cluster terms with a 
negative coefficient, yielding a sum of "negative" Kullbach-Liebler distances (some between 



conditional probability distributions), which can then be upperbounded by zero. The details 
are given in the following. 

Theorem (sufficient condition for the convergence): Let {b a -\ a +} be a set of non 

negative coefficients (allocation coefficients), defined for each pair of subclusters a~,a + , 
such that b a - < 0, b a + > 0, and a~ D a + . If the following properties hold for all basic 
clusters 7 

b a + = Yl ba ~\ a+ Va+ C 7 ( 14 ) 
-b a - > &ct ~l Q+ Va~ C 7, (15) 



a+Ca" 



then 



F\p]-F\p]<0 (16) 
F\p]-F\p]=0 p = p. (17) 

Eq. (jlfij) means that the free energy can be decreasing or constant during the procedure, while 
Eq. ()17p assures that it is constant only if the procedure has already reached convergence 
(i.e., the free energy can only decrease during the procedure). A relevant consequence of 
Eq. ()17|) is that it prevents the dynamical system defined by the NIM equations from having 
limit cycles at constant free energy, which could occur in principle. 

Proof: Let us consider the right hand side of Eq. ()1HJ) and split the sum over subclusters 
a C 7 in two sums over subclusters a + , a~ with positive or negative coefficients respectively. 
Positive coefficients b a + can be replaced by Eq. (fTIjl. while, according to Eq. ([T5]). negative 
coefficients can be replaced by 

b a - = - Y b a-\a+~d a -, (18) 

a+ C«~ 

for certain d a - > 0. Defining, for each a~ D a + , the conditional probability distributions 

P-y(X a -\X a +) = ~ , (19) 



after some simple manipulations we obtain 



«C 7 a) a-C 7 



P<y{X a - ) , \ r 7 i „ P-y \ X a~ \X a + ) 
d a - Wg ^7 r + 2^ ha ~ \ a+ l0 S ^7 i \ 

P-y{X a - ) P^\X a -\X a +) 



(20) 



The logarithm inequality log£ < £ — 1 can now be applied to all terms in the previous 
equation, because all coefficients are positive (or equivalently we get a sum of Kullbach- 
Liebler terms), and the zero upperbound of Eq. (|TT)j) is obtained. As previously mentioned, 
Eq. (j!7)) is proved by the fact that the logarithm inequality holds if and only if £ = 1, i.e., 
the Kullbach-Liebler distance between two probability distributions is zero if and only if the 
two distributions are equal. ■ 

V. SOME PARTICULAR CASES 

In this section we consider some particular choices of basic clusters, that is, some partic- 
ular CVM approximations for regular lattices on which several model systems are defined. 

A. "Plaquette" approximations 

By "plaquette" approximations we mean a class of approximations in which basic clusters 
are of a unique type (which we denote as plaquette, for example a square on a square lattice), 
while subclusters with non zero coefficients are only single sites and nearest neighbor pairs. 
Let us denote such clusters by 1 and 2 respectively, and, according to the notation introduced 
in Sec. II, let us denote by ax and a 2 the coefficients of the cluster entropy expansion, by c\ 
and C2 the numbers of plaquettes sharing a given subcluster, and by bi = dijci the normalized 
coefficients. In this class of approximations, it is possible to show that all the coefficients 
can be obtained as a function of ci, C2 and of the lattice coordination number q. Making use 
of Eq. (JTJ), and remembering that basic clusters (plaquettes) have unit a-coefficient, we can 
write 



a\ + qa 2 + Ci = 1, 



a 2 + c 2 



1 



(21) 
(22) 



from which bi = Oi/cf are easily obtained: 



c 2 -l 



(23) 




(24) 



S 



Then, we have to impose the sufficient conditions on the coefficients, Eqs. (pPfj) and (|15|). 
From Eq. (}2*3*|) we easily see that b 2 < 0, which is ok for upperbounding, but usually 61 > 0. 
We then have to couple each site to pairs that contain it and are contained in a given 
plaquette. Let us adopt the strategy of splitting the site coefficient among such pairs in 
equal parts, so that, being 6 2 |i the only allocation coefficient and r the number of pairs, 
Eqs. (JHJ) and JEJ read 

61 = rb 2]1 (25) 
-62 > 2&2[i. (26) 

The allocation coefficient may be easily eliminated, yielding the single condition 

£4<0. (27) 

It is possible to show that also the r parameter depends on Ci,c 2 , q only. Let us imagine 
to multiply the number q of nearest neighbor pairs sharing a site times the number c 2 of 
plaquettes sharing a pair. It is easy to realize that in this way we have overcounted r times 
the number c\ of plaquettes sharing the given site, i.e., 

rci = qc 2 . (28) 

With the above manipulation, the condition ()27|) can be rewritten as 

q{c 2 - 1) < 2(ci - 1). (29) 

In this form we can easily verify its validity, which is done in Tab. H] for a set of typical 
plaquette approximations. We have considered: the 2d square, triangular, and honeycomb 



lattices with a 4-site square 
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a 3-site triangle |29j , and an elementary hexagon as basic 



cluster respectively, the simple cubic (sc) lattice with a 4-site square |29J as basic cluster, anc 
the face-centered cubic (fee) lattice with a 3-site triangle [29] or a 4-site tetrahedron 
as basic cluster. 



B. B and C hierarchies 



The B and C hierarchies, originally proposed by Kikuchi and Brush 30j], are series o 



approximations with increasing cluster size, suitable for 2d square [30] and triangular 



31 



lattices. They are interesting mainly because they converge towards the exact free energy, 
in spite of the fact that the cluster size increases only in one direction. This result has 
been proved rigorously only for the C hierarchy [3], but there are numerical evidences for 

fin n 

both (20, |3Jj. Such results |3J are related to the transfer matrix concept: As the Bethe 
approximation solves exactly an Ising-like chain, the CVM, with infinitely long Id stripes as 
basic clusters (to which the B and C hierarchies tend), solves exactly a 2d lattice. Here we 
are interested in showing that these approximations verify the sufficient condition for the 
convergence discussed above. Let us consider for instance the B hierarchy on the triangular 
lattice (a completely analogous treatment holds for the C hierarchy and/or for the square 
lattice). The basic clusters, shown in Fig. ^ (top row, left column), are made up of a 
sequence of L — 1 up- and L down-pointing triangles, where L is an adjustable parameter. 
Of course, also corresponding clusters with L up- and L — 1 down-pointing triangles are 
allowed, but all basic clusters always extends only in one direction. This choice can be 
viewed as a generalization of the triangle plaquette approximation (see Fig. [TJ top row, right 
column), where of course also up-pointing triangles are included in the set of basic clusters. 
In the following rows of Fig. Q also the subclusters of the given basic cluster, having nonzero 
coefficients in the cluster entropy expansion (a-coefficients), are displayed. They are divided 
in pair-like and site-like subclusters, in that they can be put in one-to-one correspondence 
with pair and site subclusters for the triangle plaquette approximations. Such analogy is 
not only a pictorial one. In fact, it is possible to show (for instance making use of Eq. (pj), 
but see also Ref. j^]) that the a-coefficients are a2 = —1 for pair-like clusters and a\ = 1 
for site-like clusters, like for the triangle plaquette approximation. The same holds for c- 
coefncients, i.e., the numbers of basic clusters sharing a given subclusters, which turn out to 
be C2 = 2 and C\ = 6 respectively, whence 62 — — 1/2 and bi = 1/6. Finally, from Fig. ^one 
easily sees that also the same "allocation" technique as for the plaquette approximation can 
be used. Inside a given basic cluster, each site-like subcluster is shared by r = 2 pair-like 
clusters, and each pair-like cluster contains 2 site-like subclusters, whence inequality (|2Tjl is 
satisfied. 
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C. Hypercube approximation in d dimensions 



Finally, let us consider the case of a hypercubic lattice in d dimensions, and let us choose a 
(i-dimensional hypercube (ci-cube) as basic cluster. Of course, the relevant cases are d = 2, 3, 
the former of which coincides with the square plaquette approximation, mentioned above, 
but the interest of a general treatment will be clearer later. It is possible to show, by repeated 
use of Eq. (0), that clusters with non zero coefficients are only i-cubes, for i — 1, . . . , d, and 
the i-cube coefficient in d dimensions is = (—l) d ~ l . Moreover, the number of <i-cubes 
sharing a given z-cube (in d dimensions) is c[ = 2 d ~ l . As a consequence, the normalized 
coefficients turn out to be 

^ =(—)"■ (30) 



2, 

Let us now impose the sufficient conditions, Eqs. (jUj) and (|I5j). Let us notice that the 
positive coefficients, those who give problems for upperbounding, have the i index with the 

same parity as d, that is i — d — 2, d — 4, Then we can couple each i-cube with (i + 1)- 

cubes that contain it and are contained in a given <i-cube. As for plaquette approximations, 
let us split the i-cube coefficient in equal parts, so that we have a single fr^u allocation 
coefficient. We still have to observe that each i-cube is shared by d—i (z+ l)-cubes contained 
in the same <i-cube (the equivalent of the r parameter for plaquette approximations), and 
that each (i + l)-cube contains 2(i + 1) different i-cubes (the equivalent of 2 sites in a pair). 
We can then rewrite Eqs. ((H)) and ()15j) as 

bf = (d-i) 6« 1|4 (31) 
> 2(* + l)^ ) 1|i . (32) 

By eliminating the allocation coefficient, we obtain 

M (,<«) 



+ (33) 

which, replacing Eq. (j371|) and taking into account that d — i is always even (as previously 
mentioned), becomes 

2i<d-l. (34) 

Such inequality becomes more and more difficult to be satisfied as the subcluster index i 
increases. Therefore we have to consider the worst case, that is i = d — 2, leading to 

d < 3. (35) 
11 



This results essentially proves the convergence for d = 3, because the d = 2 case coincides 
with the square plaquette approximation. Nevertheless, it is mainly interesting in that it 
gives us the opportunity to experiment the natural iteration method in a case in which 
the sufficient condition is not verified. We have actually implemented the procedure for the 
simple Ising model on the d — 4 hypercubic lattice, easily finding cases in which the behavior 
is non convergent (oscillating). This fact lead us to conjecture that actually the sufficient 
condition might be also a necessary one. 

VI. AN EQUIVALENT FORMULATION 

In a recent paper j^j], a general method for the minimization of non convex function- 
al, related to the existence of suitable upperbounds to the functional to be minimized, is 
proposed and applied to the case of the CVM free energy. Different possible choices for the 
upperbounding functional are investigated. Hereafter, we show that one choice proposed 
there, which by the way turns out to be quite convenient in terms of computation time, is 
equivalent to the natural iteration method. First, let us briefly recall the general method, 
which is based on the following. 

Theorem: Let F[p] be a continuous functional in the set of variables p, defined in some 
compact domain Q, and F\p,p'\ an auxiliary continuous functional in a pair of variable sets 
p,p', defined in the domain Q 2 , having a unique minimum with respect to p' for each fixed p. 
Let the auxiliary functional satisfy the following requirements: 



that is, the auxiliary functional is an upperbound to the original functional, and equality 
holds if and only if the two arguments of the former are equal. Then the application tp : p i— > p 
defined by 



F[p'}<F[p,p'] 
F[p'] = F[p,p'] 



p =p, 



(36) 
(37) 



p = argmm F[p,p'] 



p'en 



(38) 



enjoys the properties 



F[p] < F\p] 



(39) 
(40) 




12 



Therefore, it defines an iterative method to minimize the original functional. 
Proof: It is easy to obtain the following inequality chain 



F\p]<F\p,p\<F\p,p\=F\p], 



(41) 



proving immediately Eq. (|39|). The first inequality is the first hypothesis on the auxiliary 
functional F, Eq. (J36j) : the second inequality is a consequence of the definition of <p, Eq. (J38|) : 
the equality descends from the second hypothesis on F, Eq. (|3*7|) . In order to prove also 
Eq. (j4()jl . we have to show that both inequalities hold as equalities if and only if p = p. 
As far as the former is concerned, this is a direct consequence of the hypothesis Eq. (|37jh 
while the latter is proved by the fact that F[p,p'} has a unique minimum, which is also the 
absolute minimum, with respect to p'. ■ 

Let us now consider the auxiliary functional defined by 



First of all, it is easy to see that F[p,p] = F[p], where F\p] is the CVM free energy (JSJ). 
Moreover, F[p, p'} is easily seen to be convex with respect to p', therefore, if it has a stationary 
point, it is also unique, and is a minimum. Finally, let us observe that stationarization of 
this functional with respect to p', with the usual linear constraints, gives rise just to the NIM 
equations 0, which in this way can be used to define the application (p. In order to show 
that if actually perform a minimization of F, a sufficient condition is given by Eqs. ()36j) . ()37j) 
in the above theorem, that is, we have to upperbound the quantity 



with zero. Going back to (the right hand side of) Eq. (|T3"|) . it easily turns out that this is ex- 
actly the same upperbound we have proved with the sufficient condition for the convergence 
of the NIM. 

VII. CONCLUSIONS 

Let us finally summarize our results. We have investigated on the convergence of the 
natural iteration method, proposed by Kikuchi as a minimization procedure for cluster 



F\p,p'\ = J^XX^ M x t) + lo gP 7 (%) + ^b a \ogp 1 (x a ) 



(42) 





(43) 
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variational free energies and widely employed in a lot of applications of the CVM. We have 
discussed a condition on the coefficients of the cluster entropy expansion, which is sufficient 
to prove that the free energy decreases at each iteration, ensuring the convergence of the 
method. Such a condition is based on the idea of pairing subcluster entropies with a positive 
coefficient to larger subcluster terms with a negative coefficient, yielding a set of conditional 
entropy terms with negative coefficients. It had already been proved by Kikuchi in the 
original paper (27] that negative coefficient terms give decreasing contributions to the free 
energy. We have also taken into account a set of common CVM approximations defined on 
various regular lattices, frequently encountered in applications, showing that the sufficient 
condition is always satisfied. In particular, we have devoted some attention to the class of 
hypercube approximations on the generic (d- dimensional) hypercubic lattice, showing that 
the sufficient condition is verified for d < 3. We have also implemented the natural iteration 
method for d = 4 on the simple Ising model, and found out that several (random as well as 
uniform) initial conditions give rise to non convergent (oscillating) behavior. This fact has 
led us to conjecture that the sufficient condition may be also a necessary one. Finally we have 
established a connection with a recently proposed method for the minimization of non-convex 



functionals, which can be applied to the CVM free energy 



Such a method is based on 



the existence of suitable upperbounding functionals to the functional to be minimized. In 



Ref. 



several choices of upperbounding functionals are proposed and applied to simple 
inhomogeneous systems. We have shown that one of the upperbounding choices proposed 
there (indeed quite a good choice in terms of computation time) is actually equivalent to 
Kikuchi's natural iteration method. It turns out explicitly that the upperbounding condition 
implies free energy decreasing, whence convergence. 
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TABLE I: Coefficients for different plaquette approximations. The first two columns report re- 
spectively the lattice and plaquette (basic cluster) type. The following three columns display the 
independent coefficients: q (coordination number), C2,C\ (number of plaquettes sharing a given 
pair, site). The last two columns verify the sufficient condition, in that q(c2 — 1) < 2(ci — 1). 



lattice 


plaquette 


Q 


C2 


Cl 


q(c 2 - 1) 


2(ci - 1) 


square 


square 
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6 


triangular 


triangle 
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10 


honeycomb 


hexagon 


3 
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sc 


square 


6 


4 


12 


18 


22 


fee 


triangle 


12 
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24 


36 


46 


fee 


tetrahedron 


12 


2 


8 


12 


14 
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BASIC CLUSTER PLAQUETTE 

1 3 5 . . . 2L - 1 2L + 1 1 3 

WW V 

2 4 6 ... 2L 2 

PAIR-LIKE CLUSTERS PAIRS 

1 3 5 ... 2L - 1 1 

WW \ 

2 4 6 ... 2L 2 
3 5 . . . 2L - 1 2L + 1 3 

£ZW / 

2 4 6 ... 2L 2 

1 3 5 . . . 2L - 1 2L + 1 1 3 
• • • • • • • 

SITE-LIKE CLUSTERS SITES 

1 3 5 ... 2L - 1 1 
• • • • • 

2 4 6 ... 2L 2 
• • • • • 

3 5 . . . 2L - 1 2L + 1 3 
• • • • • 



FIG. 1: Basic cluster and subclusters for the B hierarchy (left side) and for the corresponding 
(triangle) plaquette approximation. 
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